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Abstract 

QWe evaluate the expected measurement accuracy of the branching ratio of the Standard Model Higgs boson decaying 
into tau lepton pairs at the ILC with a full simulation of the ILD detector concept. We assume a Higgs mass of 125 
GeV, a branching ratio of BR (h —> t + t~) = 6.32 %, a beam polarization of electron (positron) of — 0.8(+0.3), and an 
integrated luminosity of 250 fb _1 . The Higgs-strahlung process e + e“ —> Zh with Z —> qq is analyzed. We estimate 
the measurement accuracy of the branching ratio A(cr x BR)/(cr x BR) to be 3.4% with using a multivariate analysis 
technique. 

1 Introduction 

The Higgs boson has been found at the LHC by the ATLAS experiment [1] and the CMS experiment [2]. After that, 
understanding the properties of a Higgs boson through the precise measurement is important for particle physics. A 
branching ratio (BR) is one of the most important properties of a Higgs boson. In the Standard Model, the Yukawa 
coupling of matter fermions with a Higgs boson is completely proportional to the fermion mass. However, the Yukawa 
coupling will deviate from the prediction of the Standard Model if there are new physics. The pattern of deviation depends 
on the new physics model. Thus, understanding a Higgs boson is very important from the viewpoint of new physics. 

Besides, the size of the deviation is expected to be small if the scale of new physics is high. Specifically, the allowed 
deviation can be at the few-percent level even if no additional new particles are to be found at the LHC [3]. Since the 
branching ratio measurement is used as an input in the extraction of the Yukawa coupling, a precise determination of the 
branching ratio is essential to probe new physics. 

In this study, we focus on the branching ratio of the Higgs boson decaying into tau lepton pairs. The mass of the tau 
lepton is known to a very good precision unlike quarks, which typically suffer from the theoretical uncertainties arising 
from QCD. Also, the deviation in the lepton Yukawa coupling could well differ from the quark Yukawa coupling, such as 
in the lepton-specific Two-Higgs Doublet Model. Thus, the tau Yukawa coupling is an ideal probe for new physics. 

In this study, we estimate the measurement accuracy A (a x BR)/(cr x BR) of the h —> t + t~ branching ratio at the 
center-of-mass energy of 250 GeV and 500 GeV at the ILC with the ILD full detector simulation. In this proceedings, we 
will discuss the 250 GeV case, based on our talk at the LCWS14 [4]. 


2 Signal and Background 

The Feynman diagrams of the Higgs boson production processes are shown in Figure [I] The Higgs-strahlung process 
(e + e _ —> Zh) is the most dominant production processes at the center-of-mass energy (y/s) of 250 GeV. We analyze this 
process with the Z boson decaying into quark pair (Z —> qq), which is expected to be the most sensitive channel because 
of the high statistics. The cross section of the Higgs-strahlung process at yfs = 250 GeV is about 300 fb [5]. 


1 Talk presented at the International Workshop on Future Linear Colliders (LCWS14), Belgrade, Serbia, 6-10 October 2014. 
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Figure 1: The Feynman diagrams of the Higgs boson production processes, (left): Higgs-strahlung process, (middle): 
WW-fusion process, (right): ZZ -fusion process. 

The diagrams for the main backgrounds which have the same(similar) final state as the signal are shown in Figure [2] 
The e + e~ -+ ZZ —> qqr + r~ process shown in left of Figure [21 is an irreducible background to the signal. Besides, the 
semi-leptonic decay of di-boson processes such as e + e _ —> W + W~ qq'rv shown in Figure [2] right will also to be the 
source of backgrounds due to the similar final states to the signal. 




Figure 2: The example of Feynman diagrams of the main backgrounds. 


3 Simulation Conditions 

We assume a Higgs mass of Mh = 125 GeV, a branching ratio of BR (h — > t + t~) = 6.32 % [ 6 ], an integrated luminosity 
of 250 fb _1 , and beam polarizations of P(e~,e + ) = (—0.8, +0.3). 

We use the Monte-Carlo samples which are prepared for the studies presented in the ILC Technical Design Report [5,7 9] 
(so called TDR sample). Besides, we use the event generator [10] with using GRACE [11,12] and generate Monte-Carlo 
samples of the processes of e + e _ — > ffh with h -+ t + t~ where / denotes a fermion, because the spin correlation of 
h —> t + t~ was not treated properly in TDR samples. Therefore, we do not use the events which contain h —> t + t~ 
processes in TDR samples to avoid double counting. The beam energy spectrum includes the effects due to beamstrahlung 
and the initial state radiation. The beam-induced backgrounds from 77 interactions which give rise to hadrons are included 
in all signal and background processes. The background processes from e + e~ interactions are categorized according to the 
number of final-state fermions: two fermions (2f), three fermions (lf_3f), and four fermions (4f). We also include 77 -+ 2f 
and 77 -+ 4f processes for this study. The detector response is simulated using full simulation based on Geant4 [13]. 
We perform the detector simulation with Mokka [15], a Geant4-based full simulation, with the ILD detector model of 
ILD_ol_v05. TAUOLA [16] is used for the tau decay simulation. The ILD detector model consists of a vertex detector, 
a time projection chamber, an electromagnetic calorimeter, a hadronic calorimeter, a return yoke, muon systems, and 
forward components. 


4 Event Reconstruction 

Even in the signal processes, some hadrons generated via the beam-induced 77 interactions are included. But at the 
yfs = 250 GeV, the numbers of hadrons are estimated to be small [17]. Therefore, we do not apply any special treatment 
to eliminate these hadrons. 

The event reconstruction is consisted of three steps; (1) tau reconstruction, (2) collinear approximation, and (3) Z 
boson reconstruction. 

First, we apply tau finder. Our tau finder searches for charged track which have highest energy among the remaining 
particles, and combines the neighboring particles which satisfy cos0 con e > 0.99, with the combined mass less than 2 
GeV, where 9 cone is the cone angle with respect to the highest energy charged track. The combined object is regarded 
as a tau candidate. Then, we apply the selection cuts to tau candidates as following; discard the candidates which 
categorized into 3-prong with neutral particles, E tau candidate > 3 GeV where E can( j is the energy of tau candidate, and 
satisfy E cone < (0.1 x E canc j) with cos0' one = 0.95 where E cone is the cone energy of a tau candidate with the cone angle 
of 0' cone , and 9' cone is the cone angle with respect to a tau candidate, respectively. These selections are tuned to minimize 
the misidentification of fragments of quark jets as tau decays. After the selection, we apply the charge recovery process 


2 










to obtain better efficiency. The charged particles in a tau candidate which have the energy less than 2 GeV are detached 
one by one, the one with the smallest energy first, until satisfying the following conditions; the charge of a tau candidate 
is exactly equal to ±1, and the number of track(s) in a tau candidate is exactly equal to 1 or 3. The tau candidate after 
the detaching is rejected if it does not satisfy the above conditions, and remaining candidate is regarded as a tau jet. We 
repeat the above processes until there are no charged particles which have the energy greater than 2 GeV. 

After finishing the tau reconstruction, we apply the collinear approximation [18] to reconstruct the invariant mass of 
tau pair system. In this approximation, we assume that the visible decay products of the tau lepton and the neutrino(s) 
from the tau decay is collinear, and the contribution of the missing transverse momentum comes only from the neutrino(s) 
from tau decay. 

After the approximation, we apply the Durham jet clustering [19] with two jets for the remaining objects to reconstruct 
Z boson. 


5 Event Selection 

Before optimizing the cuts, we apply preselection cuts as following; 

• the number of reconstructed t + and t~ are exactly equal to one, 

• the number of reconstructed jet is exactly equal to two, 

• the number of track in an event is greater or equal to nine, 

• M co i > 0 and E co \ > 0 where M co \(E co i) is the invariant mass (energy) of tau pair with collinear approximation in 
the unit of GeV. 

In addition, we apply the following cuts to suppress the trivial backgrounds; 

• 105 < E v i s < 215 and M v ; s > 95, where E v i s (M v i s ) is the visible energy (mass) in the unit of GeV, 

• Pt > 40, where Pt is the sum of the magnitude of the transverse momentum of each particle in the unit of GeV, 

• the thrust in an event should be less than 0.97, 

• 60 < E q q < 175 and 35 < M q q < 160, where E q q(M q g) is the energy (invariant mass) of reconstructed two jets in 
the unit of GeV, 

• cos Oqq < 0.5, where 9 qq is the angle between two jets, 

• E t + t - < 140 and 5 < M T + T - < 125, where E T + T -(AI T + T ~) is the energy (invariant mass) of reconstructed two taus 
without collinear approximation in the unit of GeV, 

• cos0 T + r - < —0.1, where 0 T + T - is the angle between two taus, 

• 30 < E co i < 270 and 15 < M co \ < 240, 

• 65 < Af r ecoii < 185, where M reC oii is the recoil mass against Z boson. 

We only use the events passed the preselection and cuts above for the optimization of event selection. We perform the cut- 
based analysis and multivariate analysis both. The optimization is performed to maximize the signal statistical significance 
of S/y/S + B, where S(B) is the number of signal (background). 

5.1 Cut-based Analysis 

We apply the following cuts sequentially to extract maximum signal significance; 

• E vis < 240 GeV, 

• | cos $ m iss| < 0.98, where 9 m i ss is the angle between missing momentum in an event and beam axis, 

• Eqq < 125 GeV, M qq > 80 GeV, 

• E t+t - < 130 GeV, A'1 T + T - < 115 GeV, cos 9 T+T - < 0.54, 

• E co i < 210 GeV, M col > 100 GeV, 

• log 10 |c?osig(T + )| + log 10 |dosig(r”)| > —0.2, where d 0 sig(r + ) is the impact parameter do (zy-plane) divided by the 
error of do for r + , 

• log 10 |zosig(r + )| + log 10 |2osig(r - )| > —0.2, where Zosig(r + ) is the impact parameter zo (rz-plane) divided by the 
error of zq for r + , 
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• -^recoil >115 GeV. 


Figure [3] shows the M co \ distribution in the sequential cuts. After all cuts above, the signal events of 1002 and the 
background events of 535.4 are remained. The statistical significance is calculated to be S/y/S + B = 25.6 a. This result 
corresponds to the precision of A(cr x BR)/(cr x BR) = 3.9%. 



Figure 3: The distribution of M co \ in the sequential cuts. Black histogram shows the summing up of all processes, blue 
shows the signal process, and red shows the all four-fermion background processes, respectively. 


5.2 Multivariate Analysis 

We use the TMVA package in ROOT [20] and use the Gradient Boosted Decision Tree (BDTG) technique for the analysis 
tool. We use the following 17 variables as the inputs; 

• three variables from the overall event: E v i s , P t (all), and cosd m i ss , where Pj(all) is the magnitude of the transverse 
momentum calculated from the momentum vector of all visible particles in an event, 

• three variables from reconstructed jets and Z boson: M qq , cos 0 qq , and cos 6z where 6z is the angle between the 
momentum of reconstructed Z boson and beam axis, 

• seven variables from reconstructed taus: M T + T -, E T + T -, cos d T + T -, cos d acop , cos 6h , log 10 |dosig(T + )|+log 10 |e?osig(T _ )|, 
log 10 |zosig(T + )| T log 1 Q |zosig(r“)|, where d acop is the acoplanarity angle between two taus, Oh is the angle between 
the momentum of reconstructed Higgs boson without collinear approximation and beam axis, 

• three variables from collinear approximation: M co i, E co i, cosOhcoI where 0h c oi is the angle between reconstructed 
Higgs momentum and beam axis with collinear approximation, 

• one from recoil mass: M leco n. 


Figure 0] shows the response of the multivariate classifier. The number of events surviving the event selection is 1205 for 
the signal and 521.4 for the background. The signal significance is calculated to be S/\/S + B = 29.0er, which implies a 
precision of A(<r x BR)/(er x BR) = 3.4%. The result of the multivariate analysis improved by ~ 13% than the cut-based 
analysis. 
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Figure 4: The multivariate classifier response for qqh mode. The green plot shows the signal significance. 


6 Comparison with Previous Results 

We previously performed the analysis with fully simulated samples at yfs = 250 GeV, and the results are written in 
Ref [21]. In Ref [21], the analysis had been performed with the condition of Higgs mass of 120 GeV, and analyzed three 
signal inodes: qqh, e + e~h 1 and n + fi~h. The combined result with the extrapolation to the Higgs mass of 125 GeV was 
A(<x x BR)/(cr x HR) = 4.2%. The results described in Section 5 are much better than previous study, even the cross 
section and the branching ratio of h —» t + t~ [ 6 ] are worse than the case of Higgs mass of 120 GeV and only using qqh 
mode. These differences mainly come from the mass difference between Z boson and Higgs boson, and the difference of 
analysis technique. Now we can apply tighter cuts in the recoil mass against Z boson than previously, because the peak 
of recoil mass shifted from 120 GeV to 125 GeV. This difference introduces good separation power between signal and 
background. Besides, we only had been performed cut-based analysis in previous. The cut-based analysis is very simple 
and easy to understand, but difficult to get better precision. The multivariate analysis gives us much better results than 
the cut-based, as described on Section 5.2. 

7 Summary 

We evaluated the expected measurement accuracy of the branching ratio A(er x BR)/(<r x BR) of the h —> t + t~ mode at 
y/s = 250 GeV at the ILC with a full simulation of the ILD detector model, assuming Mh = 125 GeV, BR(/i —> t + t~ ) = 
6.32 %, f L dt = 250 fb _1 , and beam polarizations P(e _ ,e + ) = (—0.8,+0.3). We analyzed the qqh final state using 
a cut-based approach and a multivariate approach. As a result, we expected A(cr x BR)/(ct x BR) = 3.4% with using 
Gradient Boosted Decision Tree technique. 
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